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Abstract 

A key step in many perceptual decision tasks is the integration of sensory inputs over time, 
but fundamental questions remain about how this is accomplished in neural circuits. One 
possibility is to balance decay modes of membranes and synapses with recurrent excitation. To 
allow integration over long timescales, however, this balance must be precise; this is known as the 
fine tuning problem. The need for fine tuning can be overcome via a ratchet-like mechanism, in 
which momentary inputs must be above a preset limit to be registered by the circuit. The degree 
of this ratcheting embodies a tradeoff between sensitivity to the input stream and robustness 
against parameter mistuning. 

The goal of our study is to analyze the consequences of this tradeoff for decision making 
performance. For concreteness, we focus on the well- studied random dot motion discrimination 
task. For stimulus parameters constrained by experimental data, we find that loss of sensitivity 
to inputs has surprisingly little cost for decision performance. This leads robust integrators 
to performance gains when feedback becomes mistuned. Moreover, we find that substantially 
robust and mistuned integrator models remain consistent with chronometric and accuracy func- 
tions found in experiments. We explain our findings via sequential analysis of the momentary 
and integrated signals, and discuss their implication: robust integrators may be surprisingly 
well-suited to subserve the basic function of evidence integration in many cognitive tasks. 

1 Introduction 

Many decisions are based on the balance of evidence that arrives at different points in time. This 
process is quantified via simple perceptual discrimination tasks, in which the momentary value of a 
sensory signal carries negligible evidence but correct responses arise from summation of this signal 
over the duration of a trial. At the core of such decision making must lie neural mechanisms that 
integrate signals over time (Gold and Shadlen, 2007; Wang, 2008; Bogacz et al., 2006). The function 
of these circuits is intriguing, because perceptual decisions develop over hundreds of milliseconds 
to seconds, while individual neuronal and synaptic activity often decays on timescales of several 
to tens of milliseconds - a difference of at least an order of magnitude. A mechanism that bridges 
this gap is feedback connectivity tuned to balance - and hence cancel - inherent voltage leak and 
synaptic decay (Cannon et al., 1983; Usher and McClelland, 2001). 

The tuning of recurrent connections to achieve this balance presents a challenge (Seung, 1996; 
Seung et al., 2000), illustrated in Figure 4(A) via motion of a ball on an energy surface. Here, 
the ball position E{t) represents the total activity of a circuit (relative to a baseline marked 0); 
momentary sensory input perturbs E{t) to increase or decrease. If decay dominates (upper-right), 
then E{t) always has a tendency to "roll back" to baseline values, thus forgetting accumulated 
sensory input. Conversely, if feedback connections are in excess, then activity will grow away 
from the baseline value (center). If balance is perfectly achieved via fine-tuning, (left) temporal 
integration can occur. That is, inputs can then smoothly perturb network activity back and forth, 
so that the network state at any given time represents the time-integral of past inputs. 

Koulakov et al. (2002) proposed an alternate model: a ratchet-like accumulator, equivalent to 
movement along a scalloped energy surface (Figure 4(A), bottom) (Pouget and Latham, 2002). 
Importantly, even without finely-tuned connectivity, network states can hold prior values without 
decay or growth, allowing integration of inputs over time. Thus, this mechanism is called a robust 
integrator. Energy wells can be spaced arbitrarily close together while maintaining their depth, 
so that the robust integrator can represent a practically continuous range of values. However, the 
energy wells imply a minimum input strength to transition between adjacent states, with inputs 
below this limit effectively ignored. 
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Figure 1: Schematic of neural integrator models. (A) Visualizing integration via an energy surface 
(Pouget and Latham, 2002). A robust integrator can "fixate" at a range of discrete values, indicated 
by a sequence of potential wells, despite mistuning of circuit feedback. Without these wells (the 
non-robust case), activity in a mistuned integrator would either exponentially grow or decay, as 
in the top panels. Perturbing the robust integrator from one well to the next, however, requires 
sufficiently strong momentary input. (B) As a consequence, low- amplitude segments in the input 
signal A/(t), below a robustness limit i?, are not accumulated by a robust integrator: only the high- 
amplitude segments are. The piecewise-defined differential Equation (5) captures this robustness 
behavior, resulting in the accumulated activity shown, and may be related to, e.g., a detailed 
bistable-subpopulation model. A decision is expressed when the accumulated value E{t) crosses 
the decision threshold 9. 

The two models just introduced present a tradeoff between robustness to parameter mistun- 
ing and sensitivity to inputs. Here, we ask how this tradeoff impacts behavioral performance in 
perceptual decision making. Focusing on the moving dots task (Shadlen and Newsome, 1996; 
Roitman and Shadlen, 2002), enables us to constrain model parameters to known physiology and 
behavior. Our aim is to establish whether or not the robust integrator model is consistent with 
known data, and to assess the performance benefits, if any, that it affords when network parameters 
cannot be fine-tuned. 

2 Materials and methods 
2.1 Model and task overview 

To explore the consequences of the robust integrator mechanism for decision performance, we begin 
by constructing a two-alternative decision making model similar to that proposed by Mazurek et 
al. (2003). For concreteness, we concentrate on the forced choice motion discrimination task (Roit- 
man and Shadlen, 2002; Mazurek et al., 2003; Gold and Shadlen, 2007; Churchland et al., 2008; 
Shadlen and Newsome, 1996; Shadlen and Newsome, 2001). Here, subjects are presented with a 
field of random dots, of which a subset move coherently in one direction; the remainder are relo- 
cated randomly in each frame. The task is to correctly choose the direction of coherent motion 
from two alternatives (i.e., left vs. right). 

As in Mazurek et al. (2003) (see also Smith (2010)), we first simulate a population of neurons 
that represent the sensory input to be integrated over time. This population is a rough model of 
cells in extrastriate cortex (Area MT) which encode momentary information about motion direc- 
tion (Britten et al., 1993; Britten et al., 1992; Salzman et al., 1992). We pool spikes from model 
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Sensory Neurons: Input Signals: Recurrent Integration: Threshold: 




Figure 2: Overview of model setup. Simulations of sensory neurons and neural recordings are used 
to define the left and right inputs A//(t), A/^(t) to neural integrators (see text). These inputs are 
modeled by gaussian (OU) processes, which capture noise in the encoding of the motion strength 
by each pool of spiking neurons. See Equations (l)-(3) for definition of input signals. Similar to 
Mazurek et al. (2003), the activity levels of the left and right integrators Ei{t) and Er{t) encode 
accumulated evidence for each alternative. In the reaction time task, Ei{t) and Er{t) race to 
thresholds in order to determine choice on each trial. In the controlled duration task the choice is 
made in favor of the integrator with higher activity at the end of the stimulus presentation. 

MT cells that are selective for each of the two possible directions into separate streams, labeled 
according to their preferred "left" and "right" motion selectivity: see Figure 2. 

Two corresponding integrators then accumulate the difference between these streams, left-less- 
right or vice- versa. Each integrator therefore accumulates the evidence for one alternative over the 
other. Depending on the task paradigm, different criteria may be used to terminate accumulation 
and give a decision. In the reaction time task, accumulation continues until activity crosses a 
decision threshold: if the leftward evidence integrator reaches threshold first, a decision that overall 
motion favored the leftward alternative is registered. 

Accuracy is defined as the fraction of trials that reach a correct decision. Speed is measured 
by the time taken to cross threshold starting from stimulus onset. Reaction Time (RT) is then 
defined as the time until threshold (decision time) plus 350 ms of non-decision time, accounting 
for other delays that add to the time taken to select an alternative (e.g. visual latencies, or motor 
preparation time, cf. (Mazurek et al., 2003; Luce, 1986)). The exact value of this parameter was 
not critical to our results. Task difficulty is determined by the fraction of coherently moving dots 
C (Britten et al., 1992; Mazurek et al., 2003; Roitman and Shadlen, 2002). Accuracy and RT across 
multiple levels of task difficulty define the accuracy and chronometric functions in the reaction time 
task, and together can be used to assess model performance. When necessary, these two numbers 
can be collapsed into a single metric, such as the reward per unit time or reward rate. 

In a second task paradigm, the controlled duration task, motion viewing duration is set in 
advance by the experimenter. A choice is made in favor of the integrator with greater activity 
at the end of the stimulus duration. Here, the only measure of task performance is the accuracy 
function. 
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Figure 3: Construction of gaussian (OU) processes to represent fluctuating, trial-by-trial firing rate 
of a pool of weakly correlated MT neurons (Bair et al., 2001; Zohary et al., 1994). As in Mazurek 
and Shadlen (2002), these motion sensitive neurons provide direct input to our model integrator 
circuits. Simulated spike trains from weakly-correlated, direction selective pools of neurons are 
shown as a rastergram. All spikes prior to time t - a sum over the j^^ spike from the i^^ neuron, 
for all i and j - are convolved with an exponential filter, and then summed to create a continuous 
stochastic output (right); here, H{t) is the Heaviside function. We approximated this output by 
a simpler gaussian (OU) process in order to simplify numerical and analytical computations that 
follow. 

2.2 Sensory input 

We now describe in detail the signals that are accumulated by the integrators corresponding to 
the "left" and "right" alternatives. First, we model the pools of leftward or rightward direction- 
selective sensory (MT) neurons as = 100 weakly correlated spiking cells (Pearson's correlation 
p — .11 (Zohary et al., 1994; Bair et al., 2001)); see Figure 3. Specifically, as in Mazurek and 
Shadlen (2002), each neuron is modeled via an unbiased random walk to a spiking threshold; the 
random walks of neurons in the same pool are correlated. Increasing the variance of each step in 
the random walk increases the firing rate of each model neuron; it was therefore chosen at each 
coherence value to reproduce the linear relationship between coherence C and mean firing rate /i/^-^ 
of the left and right selective neurons observed in MT recordings: 

/i^,r(C) = ro + 6;,^C . (1) 

Here the parameters ro, 6/, and hj. are derived from firing rates observed across a range of coher- 
ences (Britten et al., 1993). If evidence favors the left alternative, hi — .4 and hr — —.2; if the right 
alternative is favored, these values are exchanged. 

Next, the output of each spiking pool was aggregated. Each spike emitted from a neuron in 
the pool was convolved with an exponential filter with time constant 20 ms. This is intended as 
an approximate model of the smoothing effect of synaptic transmission. These smoothed responses 
were then summed to form a single stochastic process for each pool (see Figure 3, right). 

We then approximated the smoothed output of each spiking pool by a simpler stochastic process 
that captures the mean, variance, and temporal correlation of this output as a function of dot 
coherence. We used gaussian processes Ii{t) and Ir{t) for the rightward- and leftward-selective pools 
(See Figure 3). Specifically, we chose Ornstein-Uhlenbeck (OU) processes, which are continuous 
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gaussian process generated by the stochastic differential equations 



d/,. = '"-"^>-"'- dt + ./^diy. (2) 

' T V T 

with mean fii^^iC) as dictated by Equation 1. The variance i^i^riC) and timescale r were chosen to 
match the steady-state variance and autocorrelation function of the smoothed spiking process. As 
we will see, this timescale plays an important role in determining the decision making performance 
of robust integrators. 

Our construction so far accounts for variability in output from left vs. right direction selective 
neurons. We now incorporate an additional noise source into the output of each pool. These noise 
terms {rii{t) and rir{t)^ respectively) could approximate, for example, neurons added to each pool 
that are nonselective to direction. Each noise source is modeled as an independent OU process with 
mean 0, timescale 20 ms as above, and a strength (variance) This noise strength is a free 

parameter that we vary to match behavioral data (see "A robust integrator circuit" and Figure 14). 
We note that previous studies also found that performance based on the direction-sensitive cells 
alone can be more accurate than behavior, and therefore incorporated variability in addition to the 
output of "left" and "right" direction selective MT cells (Shadlen et al., 1996; Mazurek et al., 2003; 
Cohen and Newsome, 2009). 

Finally, the signals that are accumulated by the left and right neural integrators are constructed 
by differencing the outputs of the two neural pools: 

/\Ii{t) = [Ii{t) + r^i[t)]-[I^[t)+r]r{t)] 

AIr{t) = -AIi{t) . (3) 



2.3 Neural integrator circuit and feedback mistuning 

A central focus of our paper is variability in the relative tuning of recurrent feedback vs. decay 
in an integrator circuit. Below, we will introduce the mistuning parameter /3, which determines 
the extent to which feedback and decay fail to perfectly balance. We first define the dynamics of 
the integrator circuit on which our studies are based. This is described by the firing rates Ei^r{t) 
of integrators that receive outputs from left-selective or right-selective pools A//^^(t) respectively. 
The firing rates Ei^j.{t) increase as evidence for the corresponding task alternative is accumulated 
over time: 

= -Ei,r + (1 + + I^^IlAt)- (4) 

The three terms in this equation account for leak, feedback excitation, and the sensory input (scaled 
by a weight k)^ respectively. When the mistuning parameter /? = 0, leak and self-excitation exactly 
cancel; we describe such an integrator as perfectly tuned^ while an integrator with /3 7^ is said 
to be mistuned. Imprecise feedback tuning is modeled by randomly setting /3 to different values 
from trial to trial (but constant during a given trial), with a mean value ^ and a precision given 
by a standard deviation cr^. We assume that ^5 = for most of the study. Thus the spread of /?, 
which we take to be gaussian, represents the intrinsic variability in the balance between circuit-level 
feedback and decay. Perfect tuning corresponds to = ;5 = 0, while ^ oi ^ corresponds 
to a mistuned integrator. Finally, we set initial activity in the integrators to zero (Ei^r(0) = 0), 
and impose reflecting boundaries at Er = 0^ Ei = (as in, e.g., Smith and Ratcliff (2004)) so that 
firing rates never become negative. 
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2.4 A robust integrator circuit 



A robust integrator can be constructed from a series of bistable subpopulations, which sequentially 
activate in order to represent the accumulated evidence (Koulakov et al., 2002; Nikitchenko and 
Koulakov, 2008). The many equations that describe the evolution of these systems can be closely 
approximated with reduced models, as demonstrated in Goldman et al. (2003). We derived a 
single piecewise-defined differential equation model that approximates the dynamics of a robust 
integrator constructed from bistable pools. 

All subsequent results are based on this simplified model, which captures the essence of the 
robust integration computation: 

dt \ f3Ei^r + i^AIi,r • otherwise ^ ^ 

The first line represents the series of potential wells discussed in the Introduction (see Figure 4): if 
the sum of the mistuned integrator feedback and the input falls below the robustness limit i?, the 
activity of the integrator remains fixed. If this summed input exceeds i?, the activity evolves as for 
the non-robust integrator in Equation (4). To interpret the robustness limit i?, it is convenient to 
normalize by the standard deviation of the input signal: 

R 

STD [Mi,r{t)Y 

In this way, R can be interpreted in units of standard deviations of input OU process that are 
"ignored" by the integrator. We note that Equation (5) is similar to the effective equation derived 
for a different implementation of a robust integrator (Goldman et al., 2003). 

To summarize. Equation (5) defines a parameterized family of neural integrators, distinguished 
by the robustness limit R. As R 0, the model reduces to Equation (4). When additionally /3 = 0, 
the (perfectly tuned) integrator computes an exact integral of its input: Equation (5) then yields 

Ei^rit) OC /\Ii^r{t')dt' . 



2.5 Computational methods 

Monte Carlo simulations of Equations (l)-(5) were performed with Euler-Maruyama method (Higham, 
2001), with dt = Q.l ms. For a fixed choice of input statistics and threshold ^, a minimum of 10, 000 
trials were simulated to estimate accuracy and RT values. During simulations of the reaction time 
task, in order to prevent excessively long trials (particularly at low coherence values) a maximum 
simulation time was set at 10,000 ms. At this time, if neither integrator had reached threshold, the 
indeterminate result was broken by a numerical "coin flip" , (this rarely occurred, as indicated by 
the RT histograms in Figures 15, 16). 

In simulations where cr/3 > 0, results were generated across a range of /3 values and then 
marginalized by weighting according to a normal distribution. The range of values was chosen with 
no less than 19 linearly spaced points, across a range of ± 3 standard deviations around the mean 

^. 

Reward rate values presented in "Reward rate and the robustness-sensitivity tradeoff' are pre- 
sented as maximized by varying the free parameter 9] values were computed by simulating across a 
range of 6 values. The range and spacing of these values were chosen dependent on the values of R 
and /? for the simulation; the range was adjusted to capture the relative maximum of reward rate 
as a function of ^, while the spacing was adjusted to find the optimal 6 value with a resolution of 
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Figure 4: Parameter space view of four integrator models, with different values of the robustness 
limit R and feedback mistuning cr^. The impact of transitioning from one model to another by 
changing parameters is either to enhance or diminish performance, or to have a neutral effect (see 
text). 

± 0.1. 

Values of 9 and z/^ in the table included in Figure 14 were chosen to best match accuracy and 
chronometric functions to behavioral data reported in Roitman and Shadlen (2002). This was 
accomplished by minimizing the sum-squared error in data vs. model accuracy and chronometric 
curves across a discrete grid of 6 and values, with a resolution of 0.1. When data between 
simulated values were needed, linear interpolation was used to approximate the corresponding 
accuracy and RT values. 

Autocovariance functions of integrator input, presented later, were computed by simulating an 
Ornstein-Uhlenbeck process using the exact numerical technique in Gillespie (1996) with dt = 0.1 
ms, to obtain a total of 2^^ sample values. Sample values of the process less than the specified 
robustness limit R were set to 0, and the autocovariance function was computed using standard 
Fourier transform techniques. 

Simulations were performed on NSF Teragrid clusters. 

3 Results 

3.1 How do robustness and mistuning affect decision speed and accuracy? 

In the Methods, we define a general neural integrator model (Equation (5)) that accumulates signals 
representing the output of motion sensitive neurons (Equation 3). The integrator model includes 
two key parameters. The first is /?, which represents the mistuning of feedback from a value that 
perfectly balances decay; the extent of this mistuning is measured by a^, the standard deviation 
of /3 from the ideal value /3 = 0. The second is the robustness limit R. We emphasize twin 
effects of R: as R increases, the integrator becomes able to produce a range of graded persistent 
activity for ever- increasing levels of mistuning (see Figure 4 (A), where R corresponds to the 
depth of energy wells). This prevents runaway increase or decay of activity when integrators are 
mistuned; intuitively, this might lead to better performance on sensory accumulation tasks. At the 
same time, as R increases integrator activity remains fixed even for increasingly strong positive or 
negative momentary input A// ,^ (see Figure 4 (B), where R specifies a limit within which inputs 
are "ignored"). Such sensitivity loss should lead to worse performance. This implies a fundamental 
tradeoff between competing effects: (1) one would prefer to not ignore relevant input stimulus. 
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Figure 5: Mistuned feedback diminishes decision performance. (Inset) In both figures we consider a 
move in parameter space from the "basehne" model to the "mistuned" model by changing = ^ 
0.1 (A) In the controlled duration task, accuracy is lower for the "mistuned" model (dashed line) 
than for the "baseline" model (solid line) at every trial duration T, indicating a loss of performance 
when increases. (B) In the reaction time task, we plot the curve of all (RT, accuracy) pairs 
attained by varying the decision threshold 9 (see text). Once again, accuracy is diminished by 
mistuning. 



favoring small i?, and (2) one would prefer an integrator robust to mistuning, favoring large R. 

Figure 4 gives a schematic of how the two model parameters, and i?, define a plane of possible 
integrator models. Here, we explore decision performance in four different cases arranged in this 
plane. By contrasting integrators with different values of the robustness limit i?, we can assess how 
the fundamental tradeoff plays out, to either improve or degrade decision making performance. 

In order to assess this performance, we consider relationships between decision speed and accu- 
racy in both controlled duration and reaction time tasks. In the controlled duration task, we simply 
vary the stimulus presentation duration, and plot accuracy vs. experimenter-controlled stimulus 
duration. In the reaction time task, we vary the decision threshold 9 — treated as a free parameter 
— over a range of values, thus tracing out the speed accuracy curve for all possible pairs of speed 
and accuracy values. Here, speed is measured by reaction time (RT), the latency between the onset 
of stimulus and crossing of the decision threshold. For both cases, we use a single representative 
dot coherence (C=12.8 in Equation 1); results are qualitatively similar for other coherence values 
(data not shown); slightly (approx. 25%) lower robustness limits are required at the lowest dot 
coherence of C = 3.2. 

We first study a case we call the "baseline" model, for which there is no mistuning or robustness: 
cr^ = ^ = 0. Speed accuracy plots for this model are shown as a sohd hne in Figs. 5(A) and (B), 
for the controlled duration and reaction time tasks respectively. We compare the "baseline" model 
with the "mistuned" model, for which the feedback parameter has a standard deviation of = 0.1 
(10% of the mean feedback) and robustness i? = remains unchanged. In the controlled duration 
task (Figure 5(A)) we observe that mistuning diminishes accuracy by as much as 10%, and this 
effect is sustained even for arbitrarily long viewing windows (cf. (Usher and McClelland, 2001; 
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Figure 6: Increasing the robustness limit R helps recover performance lost due to feedback mis- 
tuning. (Inset) We illustrate this by moving in parameter space from the "mistuned" model to 
the "recovery" model, by changing i? = ^ .85. The impact on decision performance is shown 
for both the controlled duration (A) and reaction time (B) tasks. For each task we plot the rela- 
tionship between speed and accuracy as above: solid lines indicate the "baseline" model, dotted 
the "mistuned" model, and now dash-dotted the "recovery" model. We find that R > yields a 
modest performance gain for the "recovery" model in comparison with the "mistuned" model. 



Bogacz et al., 2006)). The reaction time task (Panel B) produces a similar effect: for a fixed RT, 
the corresponding accuracy is decreased. 

Next we increase the robustness limit to = 0.85 — so that almost ± a standard deviation of 
the input stream is "ignored" by the integrators — while maintaining feedback mistuning. We call 
this case the "recovery" model because robustness compensates in part for the performance loss due 
to feedback mistuning: the speed accuracy plots in Figure 6 for the recovery case lie above those 
for the "mistuned" model. For example, at the longer controlled task durations (Panel A) and 
reaction times (Panel B) plotted, 20% of the accuracy lost due to integrator mistuning is recovered 
via the robustness limit R = 0.85. 

Finally, we study the remaining possibility, when the robustness parameter R is increased from 
zero in a perfectly tuned integrator (cr/3 = 0); this is the "robust" case in Figure 4. We expected 
performance to be substantially diminished as a consequence of lost sensitivity to inputs. However, 
Figure 7 demonstrates that this is not the case: speed accuracy curves for R = 0.85 almost coincide 
with those for the "baseline" case of i? = 0. We emphasize again that because R measures ignored 
input in units of the standard deviation, the integrator circuit is actually not integrating the weakest 
60% of the input stimulus. Given this large amount of ignored stimulus, the fact that the robust 
integrator produces nearly the same accuracy and speed as the "baseline" case is surprising. This 
implies that the "robust" model can protect against feedback mistuning, without substantially 
sacrificing performance when feedback is perfectly tuned. 

To summarize, the ratchet-like mechanism of the robust integrator appears well-suited to the 
decision tasks at hand. This mechanism counteracts some of the performance lost when feedback 
fails to be perfectly fine-tuned. Moreover, even when this fine-tuning is achieved, a robust integrator 
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Figure 7: Increasing R alone does not compromise performance. (Inset) We illustrate this by 
moving in parameter space from the "baseline" to the "robust" model. For both (A) controlled 
duration and (B) reaction time tasks, we plot the relationship between speed and accuracy. Solid 
hues give results for the "baseline" model, and dash-dotted for the robust model. The curves are 
very similar in the "baseline" and robust cases, indicating little change in decision performance due 
to the robustness limit ^ = 0.85. 



still performs as well as the "baseline" case that is perfectly sensitive to the input signal. In the 
next section, we begin to explain this observation by constructing several simplified models and 
employing results from statistical decision making theory. 

3.2 Analysis: Robust integrators and decision performance 
3.2.1 Controlled duration task: Discrete time analysis 

We can begin to understand the effect of the robustness limit on decision performance by formu- 
lating a simplified version of the evidence accumulation process. We focus first on the controlled 
duration task, where the analysis is somewhat simpler. 

Our first simplification is to consider a single accumulator E which receives evidence for or 
against a task alternative in discrete time. The value of E on the i^^ time step, Ei, is allowed 
to be either positive or negative, corresponding to accumulated evidence favoring the leftward or 
rightward alternatives, respectively. On each time step, Ei increments by an independent, random 
value Zi with a probability density function (PDF) fz{Z). We first describe an analog of the 
"baseline" model above; i.e., in the absence of robustness (i? = 0). Here, we take the increments 
Zi to be independent, identical, and normally distributed, with a mean fi > (i.e. biased toward 
the leftward alternative; we call this the preferred alternative) and standard deviation a: that is, 
Zi ^ N (/X, cr^). After the v}^ step, we have 

n 

En ^ Zi . 

i=l 
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Figure 8: The effect of i? on a discrete time increment distribution, and the second real root of 
the moment generating function of this distribution. (A) The PDF of the random variable Z^, 
with probability mass for values between the robustness limit R re-allocated as a delta function 
centered at zero = 1). (B) The second real root /iq of Mz^{s) remains unchanged as R increases 
from 0^2. (Lines are uniformly distributed in this range.) This implies that in the reaction time 
task, no changes in the accuracy and chronometric functions will be observed until the deviation 
in E[Zb\ becomes large (Discussed in "Reaction time task: Continuous analysis"). 



In the controlled duration task, a decision is rendered after a fixed number of time steps N ^ (i.e. 
n — N) and a correct decision (i.e., in favor of the preferred alternative) occurs when En > 0. By 
construction, ^ N (nfi, na^)^ which implies that accuracy (Ace) can be computed as a function 
of the signal-to-noise ratio (SNR) 5 = ^ of a sample: 

l + Erff./^5 

Acc / e 2iva2 dx ^ ^ . (7) 

Jo V2^A^ 2 

Next, we change the distribution of the accumulated increments Zi to construct a discrete 
time analog of the robust integrator. Specifically, increasing the robustness parameter to i? > 
affects increments Zi by redefining the PDF fz{Z) so that weak samples do not add to the total 
accumulated "evidence", precisely as in Equation (5). (Models where such a central "region of 
uncertainty" of the sampling distribution is ignored have previously been studied in a race-to- 
bound model (Smith and Vickers, 1989); see Discussion). This requires reallocating probability 
mass below the robustness limit to zero. We plot the resulting PDF in Figure 8(A), where the 
reallocated mass gives a weighted delta function at zero. Specifically: 

(Z) = * (0) £ /. (z-) iz' + { ; Xl< (8) 

The central limit theorem then allows us to approximate the new cumulative sum E^^ as 
a normal distribution (for sufficiently large A^), with and a in Equation (7) replaced by the 
mean and standard deviation of the PDF defined by Equation (8). As before, we normalize R 
by the standard deviation of the increment, R — and then express the fraction correct Acc^ 
as a function of R and s. One can think of R as perturbing the original Acc function (Equation 
(7)), and although this perturbation has a complicated form, we can understand its behavior by 
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Figure 9: Comparison of the decision accuracy predicted by discrete and continuous time approxi- 
mate models for the controhed duration task (as above, coherence C = 12.8). (A) Both discrete and 
continuous time models predict that accuracy {Acc) will stay roughly constant as R increases up 
to ^ 0.5, followed by a gradual decrease. However, the continuous time approximation provides a 
closer match to results for the full model pictured in Figure 7(A) (see text). Additionally, compar- 
ing the dashed and dot-dashed lines shows that the approximation in Equation 9 provides a good 
description of the discrete time model for < 1. (B) The (numerically computed) autocovariance 
functions of the input signal Zf^{t) at various levels of R that were used to construct the continuous 
time curve in (A) (Equation (9)). (Inset) Two of these same autocovariance functions (for R = 
and = 1.5) are plotted normalized to their peak value. This shows that autocorrelation falls off 
faster as R increases. 



observing that its Taylor expansion has only one nonzero term up to fifth order in R: 

Acc^{N) ^ Acc{N){l - P{N)) (9) 

P{N) = ^ R^ + 0{R^) 

OTT 

Thus, for small values of R (giving very small R^), there will be little impact on accuracy. 

Equation (9) can therefore partially explain the key observation in Fig 7(A) that R can be 
increased to 0.85 while incurring very little performance loss. For concreteness, we focus on decisions 
at T = 500 ms. To make a rough comparison, we first assume that a new sample of evidence arrives 
in the discrete time model every 10 ms. We then set the SNR s so that Acc = 0.92 for the discrete 
model when = 50, agreeing with the accuracy obtained from the continuous model at 500 
ms (Fig 7(A)) when i? = 0. We then increase the robustness limit R. Figure 9(A) shows that 
accuracy for both the discrete time model itself (dashed line), and its approximation up to 0{R^) 
(dot-dashed), barely decrease at all while R is less than 0.5, and then begin to fall off. This is 
consistent with the results for the full model in Fig 7(A). However, the discrete time model does 
predict a small decrease in accuracy at i? = 0.85 that is not seen in the full model. In the next 
section, we explain how this discrepancy can be resolved. 



3.2.2 Controlled duration task: Continuous time analysis 

We next extend the analysis of the controlled duration task in the previous section to signal inte- 
gration in continuous time. In brief, we follow a method developed in Gillespie (1996) to describe 
the evolution of the mean and variance of a continuous input signal that has been integrated over 
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time. This is challenging and interesting because, as for the signals used in modeling the random 
dots task above (see Methods, Sensory Input), this input signal contains temporal correlations. As 
in the previous section, we describe the distribution of the integrated signal at the final time T, 
which determines accuracy in the controlled duration task. 

We first replace the discrete input samples Zi from the previous section with a continuous signal 
Z{i\ which we take to be a (OU) gaussian process with a correlation timescale derived from our 
model sensory neurons (see Methods). We define the integrated process 



dE 
~dt 



Z{t) E{t) = f Z{t')dt' 
Jo 



with initial condition E{0) = 0. 

Assuming that Z(t) satisfies certain technical conditions that are easily verified for the OU pro- 
cess (wide-sense stationarity, a-stability, and continuity of sample paths (Gardiner, 2002; Billings- 
ley, 1986; Gillespie, 1996)), we can construct differential equations for the first and second moments 
{E{t)) and (£'^(t)) evolving in time. We start by taking averages on both sides of our definition of 
E{t)^ and, noting that E{0) = 0, compute the time- varying mean: 

= {Z{t)) =^ {E{t)) = t {Z{t)) . (10) 
Similarly, we can derive a differential equation for the second moment of E{t): 

The righthand side of this equation can be related to the area under the autocovariance function 
A (r) = {Z{t)Z{t + r)) - {Z{t)f of the process Z{t): 

{Z{t)E{t)) ^ (z{t) Z{s)dsJ = j^{Z{t)Z{s))ds 

= [ {Z{t)Z{t - t)) dT 
Jo 

= f A{T) + {Z{t)fdT 

Jo 

We now have an expression for how the second moment evolves in time. We can simplify the result 
via integration by parts: 

pt ps pt 
{E^{t)) = 2 A{T) + {Z{t)fdTds = 2 {t-T)A{T)dT + f{Z{t)f 

Jo Jo Jo 
=^ Var[£;(t)]=2 / {t-T)A{T)dT. (11) 

Because E{t) is an accumulation of gaussian random samples Z{t)^ it will also be normally dis- 
tributed, and hence fully described by the mean (Equation (10)) and variance (Equation (11)) (Billings- 
ley, 1986). 

To model a non-robust integrator, as discussed above we take Z{t) to be a OU process with 
steady-state mean and variance fi and a^, and time constant r. For the robust case, we can 
follow Equation 5 and parameterize a family of processes Zf^{t) with momentary values below 
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the robustness limit R set to zero. (Here, we again normalize the robustness limit by the standard 
deviation of the OU process.) We numerically compute the autocovariance functions (r) of these 
processes, and use the result to compute the required mean and variance, and hence time-dependent 
signal-to-noise ratio SNR(t), for the integrated process E{t). This yields 

SMR,(t) = -Jm^ = , . (12) 

Under the assumption that E{T) is approximately gaussian for sufficiently long T (which can be 
verified numerically), we use this SNR to compute decision accuracy at T: 

Acc^{T) ^ "--^^ L . (13) 

This function is plotted for T = 500 ms as the solid line in Figure 9(A). The plot shows that 
accuracy remains relatively constant until the robustness limit R exceeds ^ 0.85. Interestingly, this 
is a longer range of R values than for the discrete time case (compare dotted line in Figure 9(A)), 
and is closer to the results for the full model pictured in Figure 7(A). 

Why does the robustness limit appear to have a milder effect on degrading decision accuracy 
for our continuous vs. discrete time input signals? We can get some insight into the answer by 
examining the autocovariance functions Af^ (r), which we present in Figure 9(B). When normalized 
by their peak value, the autocovariance for R > 0.5 falls off more quickly vs. the time lag r (see inset 
in Figure 9(B)), indicating that subsequent samples become less correlated in time. Thus, there 
are effectively more "independent" samples that are drawn over a given time range T, improving 
the fidelity of the signal. This effect is not present in our discrete time model. 

Summary: Our analysis of decision performance for the controlled duration task shows that two 
factors contribute to the preservation of decision performance for robust integrators. The first is 
that, for robustness limits up to ^ 0.5, the momentary SNR of the inputs is barely changed by 
setting values below robustness limit to zero. The second is that, as R increases, the signal Z^{t) 
being integrated becomes less correlated in time. This means that (roughly) more independent 
samples will arrive over a given time period. 



3.2.3 Reaction time task: Discrete analysis 

We begin our analysis of the reaction time task by introducing a discrete time, discrete space 
random walk model. In this model, schematized in Figure 10(A) with five intermediate states, a 
particle representing the accumulated value E starts at a state balanced between two absorbing 
"sink" states. At every time step, the particle moves towards the "correct" (i.e. preferred) sink 
with probability p(l — i?), and towards the "incorrect" (null) sink with probability {l—p){l — R) (we 
consider p > 0.5, biasing the random walk toward the "correct" sink). There is also the possibility 
that the particle might remain in the current state, with probability R. 

We now draw an analogy between the states in this random walk model and the ratcheting 
dynamics among energy wells in a robust integrator (see Figure 4 and Introduction). Here, the po- 
sition of the particle represents the accumulated evidence for the left vs. right alternatives, and the 
absorbing states represent crossing of the corresponding decision thresholds. When the robustness 
limit R is increased, the wells - each of which could represent a bistable neural subpopulation (see 
Methods) - act to hold the particle in a given state, with a probability set by R. 
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Figure 10: Analysis of the effect of the robustness hmit in the reaction time task. (A) Biased 
random walk between two absorbing boundaries. The correct and incorrect states act as "sinks" 
of the discrete time discrete space Markov chain. The intermediate states are analogous to the 
potential wells in the robust integrator model. Here, a particle will remain in the current state 
at the next time step with probability R. The final probability of ending up in either sink is 
independent of R. (B) Speed accuracy tradeoff curves from Figure 7 are replotted and labeled (c), 
while a new line labeled (d) shows the performance predicted by the discrete time, continuous space 
model described in "Reaction time task: Continuous analysis". Here the signal-to- noise ratio of 
the discrete increments were chosen so that the line generated in the i? = case would overlay the 
line from the i? = continuous model, and is therefore not plotted. We see that the discrete time 
model over-predicts the impact of the robustness limit i?, just as in the controlled duration case 
(Figure 9). 
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As R is increased in the random walk model, the probability of transitioning out of a given state 
similarly decreases. Standard results on Markov chains (see, for example, Kemeny and Snell (I960)) 
provide formulas for the probability that the particle will end in one vs. the other sink, as well as 
the expected number of time steps until this occurs, based on the transition matrix associated with 
the random walk. The probability of ending in the "correct" sink corresponds to decision accuracy, 
and is found at the middle entry in the solution vector x of the matrix equation 

(I-Q)x=(l-i?)pei . (14) 

Here Q is a tridiagonal matrix with R on the main diagonal, p{l — R) on the lower diagonal, and 
{1 — p){l — R) on the upper diagonal; ei is the canonical basis vector with e^"^ = 1, and all other 
entries equal to 0. After some factoring, we find a common factor of (1 — R) on both sides of the 
equation; thus the solution to x is independent of R. This implies that the probability of ending 
up in the correct state is unchanged by increasing R from the non-robust case (i? = 0). Intuitively 
this makes sense: if one conditions on the fact that one will leave the current state on the next 
time step, the probability of moving toward the correct and incorrect states are independent of R. 

The same is not true for the expected number of steps necessary to reach a sink (by analogy, 
the reaction time). This is because the matrix system that yields reaction times is: 

(I-Q)t = l. (15) 

Here the right-hand side of this equation is the vector of all ones, and therefore no equivalent 
cancellation can occur. However, we do notice that the reaction time with i? ^ is just a rescaling 
of the original reaction time with i? = 0. Specifically, if tji is the expected number of steps required 
to reach an absorbing state, then 

Thus, the only effect of the robustness limit R is to delay arrival at the sinks. 

Summary: We have used a simplified random walk model to gain intuition about the effect of the 
robustness limit in the reaction time task, and to show that adding a robustness limit only affects 
decision latency, but not accuracy. In the next section, we will derive a similar result for continuous 
sample distributions. 



3.2.4 Reaction time task: Continuous analysis 

We return to the continuous sampling distribution introduced in "Controlled duration task: Dis- 
crete time analysis" , but now in the context of threshold crossing in the reaction time task. The 
accumulation of these increments toward decision thresholds can be understood as the sequen- 
tial probability ratio test, where the log-odds for each alternative are summed until a predefined 
threshold is reached (Wald, 1945; Gold and Shadlen, 2002; Luce, 1963; Laming, 1968). Wald (1944) 
provides an elegant method of computing decision accuracy and speed (RT). The key quantity is 
given by the moment generating function (MGF, denoted Mz{s) and defined in Equation 19) for the 
samples Z (see Luce (1986) and Doya (2007), Chapter 10). Under the assumption that thresholds 
are crossed with minimal overshoot, we have the following expressions: 



Acc = 



RT = 



1 + e^^o 



E[Z] 



tanh 



(17) 
(18) 
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where ho is one of the two real roots of the equation Mz{s) = 1 (the other root is precisely 1) and 
9 is the decision threshold. 

We first consider the case of a non-robust integrator, for which the samples Z are again normally 
distributed. In this case, we must solve the following equation to find s = ho: 

fz{z)e'*'dz = e—+'^' = l. (19) 

-OO 

It follows that 5 = 1 and s = ho = —2^ provide two real solutions of this equation. (Wald's 
Lemma ensures that there are exactly two such real roots, for any sampling distribution meeting 
easily satisfied technical criteria.) 

When the robustness limit i? > 0, we can again compute the two real roots of the associated 
MGF. Here, we use the increment distribution fz^{Z) given by Equation (8), for which all proba- 
bility mass within of is reassigned precisely to 0. Surprisingly, upon plugging this distribution 
into the expression Mz{s) = 1, we find that 5 = 1, /iq continue to provide the two real solutions to 
this equation regardless of R, as depicted in Figure 8(B). 

This observation implies that (1) accuracies (Equation (17)) are unchanged as R is increased, 
and (2) reaction times (Equation (18)) only change when E[Zji\ changes. In other words, the inte- 
grator can ignore inputs below an arbitrary robustness limit at no cost to accuracy, and a penalty 
in terms of reaction time will only be observed when E[Zr] changes appreciably. Generalizing our 
result, we note that a sufficient condition for ho to be unchanged as R changes is that the original 
sampling distribution fz{z) obeys 

fz{z) = fz{-z)e^°'; (20) 

it is straightforward to verify that the Gaussian satisfies this property. 

We next determine the magnitude of R necessary to change E[Zji\. When we substitute R = ^ 
and compute the perturbation to E[Zji], we again find only one term up to fifth order in R: 

/oo 
zfz^iz) dz = E[Z]{l-P) (21) 
-OO 

P=./|^e-K^)'i23 + o(i?5) 
V Ott 

This outcome is similar to the controlled duration case: small values of R will have little effect on 
E[Zr\, and therefore little effect on increasing decision speed (via Equation (18)). Moreover, as we 
have already shown, accuracy is unaffected by robustness limits R of any value. As a consequence 
we expect speed accuracy curves to change only modestly for small values of R. We illustrate 
this via a speed accuracy plot in Figure 10(B). Here, the present discrete time, continuous space 
model produces the chain-dotted curve (marked (d)), showing a moderate decrease in performance 
at ^ = 0.85. This decrease is purely due to the increase in RT just discussed. 

However, the model at hand does not reproduce the speed accuracy curve for the continuous 
time model shown in Figure 10(B). Indeed, the continuous time model produces better performance 
(higher accuracy at a given speed). This suggests an additional effect in the continuous time case: 
once again, the fact that R reduces autocorrelation of the integrated signal increases the fidelity of 
the input, improving performance (see inset in Figure 9(B)). Unlike the simpler controlled duration 
task, attempting a mathematical analysis of this effect is beyond the scope of this paper. 

Summary of analysis: We pause to summarize our analysis of how the robustness limit R impacts 
decision performance. For both the controlled duration and reaction time tasks, we first studied 
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Figure 11: Using the Reward Rate metric to quantify recovery of decision performance as the 
robustness hmit is increased, C — 12.8. (A) Speed accuracy curves plotted for multiple values 
of i?; as in previous figures, the greater accuracies found at fixed reaction times indicate that 
performance improves as R increases. The heavy line indicates the "baseline" case of a perfectly 
tuned, non-robust integrator (repeated from Figure 5(B)). RR isoclines are plotted in background 
(dotted lines; see text), and points along speed accuracy curves that maximize RR are shown as 
circles. These maximal values of reward rate are plotted in (B), demonstrating the non-monotonic 
relationship between R and the best achievable RR. 



the effect of this limit on the evidence carried by momentary values of sensory inputs. In each task, 
this effect was more favorable than might have been expected: in the controlled duration case, the 
signal to noise ratio of momentary inputs was preserved for a fairly broad range of i?, while in 
the reaction time task, R was shown to affect accuracy but not speed at fixed decision threshold. 
Moreover, the robustness mechanism serves to decorrelate input signals in time. This contributes 
further to decision performance being preserved as the robustness limit increases. 

3.3 Reward rate and the robustness-sensitivity tradeoff 

Until now, we have examined performance in the reaction time task by plotting the full range 
of attainable speed and accuracy values. The advantage of this approach is that it demonstrates 
decision performance in a general way. An alternative, more compact approach, is to assume a 
specific method of combining speed and accuracy into a single performance metric. This approach 
is useful in quantifying decision performance, and rapidly comparing a wide range of models. 
Specifically, we use the reward rate (RR) (Gold and Shadlen, 2002; Bogacz et al., 2006): 

FC 

RR = ——^-^ . (22) 

Reward rate can be thought of as the number of correct responses made per unit time, with 
a delay T^ei imposed between responses to penalize rapid guessing. Implicitly, this assumes a 
motivation on the part of the subject which may not be true; in general, subjects rarely achieve 
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Figure 12: A nonzero robustness limit improves performance across a range of mistuning baises 
^. In both the reaction time (A) and controlled duration (B) tasks, robustness helps improve 
performance when (3 ^ .1^), for all values of ^ shown. Here, as in previous panels, the coherence 
of the sensory input \s C — 12.8. In the reaction time task, 9 is varied for each value of ^ to find 
the maximal possible reward rate RR (see text); T^ei — 3 seconds. In the controlled duration case, 
9 = 15.3 is fixed, in agreement with a value indicated by behavioral data (see below and Figure 14). 



optimality under this definition as they tend to favor accuracy over speed in two-alternative forced 
choice trials (Zacksenhouse et al., 2010). Here, we simply use this quantity to formulate a scalar 
performance metric that provides a clear, compact interpretation of reaction time data. 

Plotted in Figure 11(A) are multiple accuracy vs. speed curves. The heavy solid line corresponds 
to the "baseline" model with robustness and mistuning set to zero (see Figure 4). The lighter solid 
line corresponds to the "mistuned" model with cr^ = .1. The remaining dashed lines correspond to 
the "recovery" model for three different, nonzero levels of the robustness limit R. Also plotted in 
the background as dashed lines are RR isoclines - that is, lines along which RR takes a constant 
value, with T^ei — 3 sec. On each accuracy vs. speed curve, there exists a RR-maximizing (RT, 
accuracy) pair. This corresponds to a tangency with one RR isocline, and is plotted as a filled 
circle. In general, each model achieves maximal RR via a different threshold 9] values are specified 
in the legend of Panel (B). (A general treatment of RR-maximizing thresholds for drift-diffusion 
models is given in Bogacz et al. (2006).) 

In sum, we see that mistuned integrators with a range of increasing robustness limits R achieve 
greater RR, as long as their thresholds are adjusted in concert. The optimal values of RR for a range 
of robustness limits R are plotted in Figure 11(B). This figure illustrates the fundamental tradeoff 
between robustness and sensitivity discussed above. If there is variability in feedback mistuning 
((j/3 > 0), increasing R can help recover performance. However, beyond at a certain point increasing 
R further starts to diminish performance, as too much of the input signal is ignored. 

3.4 Biased mistuning towards leak or excitation 

We next consider the possibility that variation in mistuning from trial to trial could occur with a 
systematic bias in favor of either leak or excitation, and ask whether the robustness limit has quali- 
tatively similar effects on decision performance as for the unbiased case studied above. Specifically, 
we draw the mistuning parameter /3 from a gaussian distribution with standard deviation = 0.1 
as above, but with various mean values ^ (see Methods). In Figure 12(A) we show reward rates as 
a function of the bias ^5, for several different levels of the robustness limit R. At each value of ^5, the 
highest reward rate is achieved for a value of i? > 0; that is, regardless of the mistuning bias, there 
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Figure 13: Effect of the robustness limit R on decision performance in a controlled duration task, 
under the bounded integration model of Kiani et al (2008) (see text). Dot coherence C = 12.8. 
(A) Increasing the robustness limit R helps recover performance lost to mistuning at multiple 
reaction times in the controlled duration task. Specifically, moving from the "baseline" model 
to the "mistuned" model decreases decision accuracy (solid arrow), but this lost accuracy can be 
partially or fully recovered for i? > (dotted arrow). (B) When allowing for biased mistuning 
(;5 7^ 0, a = .1), still allows for recovery of performance; effects are most pronounced when 

exists a > that will improve performance vs. the non-robust case (i? = 0). We note that this 
improvement appears minimal for substantially negative mistuning biases, but is more noticable 
for the values of P that yield the highest RR. Finally, the ordering of the curves in Figure 12(A) 
shows that, for many values of ^5, this optimal robustness limit is an intermediate value less than 
R = 2. 

While Figure 12 only assesses performance via a particular performance metric (RR, T^ei = 3 
sec), the analysis in "Reward rate and the robustness-sensitivity tradeoff' suggests that the result 
will hold for other performance metrics as well. Moreover, Figure 12(B) demonstrates the analogous 
effect for the controlled duration task: for each mistuning bias ^5, decision accuracy increases over 
the range of robustness limits shown. 

3.5 Bounded integration as a model of the fixed duration task 

We have demonstrated that increasing the robustness limit R can improve performance for mistuned 
integrators, in both the reaction time and controlled duration tasks. In the latter, a decision is made 
by examining which integrator had accumulated more evidence at the end of the time interval. In 
contrast, Kiani et al. (2008) argue that decisions in the controlled duration task are actually made 
with a decision threshold (or bound). That is, evidence accumulates toward a bound as in the 
reaction time task; if accumulated evidence crosses the bound before the end of the task duration, 
the subject simply waits for the opportunity to report the choice, ignoring any further evidence. 

Figure 13 demonstrates that our observations about the how the robustness limit can recover 
performance lost to mistuned feedback carry over to this model of decision making as well. Specif- 
ically, Panel 13(A) shows how setting R > improves performance in a mistuned integrator. In 
fact, more of the lost performance (up to 100%) is recovered than in the previous model of the 
controlled duration task (cf. Figure 6(A)). Panel 13(B) extends this result to show that some value 
of i? > will recovers lost performance over a wide range of mistuning biases ^ (cf. Figure 12(B)). 
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3.6 Compatibility of the robust integrator model with behavioral data 

Given the fact that the robustness property can improve decision performance in our model, we 
next ask whether robust hmits R > are compatible with known behavioral data. To answer this 
question, we fit accuracy and chronometric functions from robust integrator models to behavioral 
data of Roitman and Shadlen (2002) in the reaction time task. This fit is via least squares across 
the range of coherence values, and requires two free parameters: additive noise variance (see 
Methods) and the decision bound 9. 

Figure 14 shows the results. Panels (A) and (B) display accuracy and chronometric data (dots) 
together with fits for various integrator models. First, the solid line gives the fit for the "baseline" 
model. The close match between model and data agrees with findings of prior studies (Mazurek et 
al., 2003). Next, the dashed and dotted lines give fits for mistuned models {a^ = 0.1), with three 
values of bias in feedback mistuning (P). To obtain these fits, both z/^ and 9 are changed from 
their values for the baseline case. In particular, the noise variance z/^ is lowered when feedback is 
mistuned. This makes intuitive sense: we have seen in Figure 5 that mistuned feedback worsens 
performance for a given signal, so that matching a fixed dataset with a mistuned integrator requires 
improving the fidelity of the incoming signal. 

Figures 14(C), (D) show analogous results for robust integrators. For all cases in these panels, 
we take the robustness limit R = 1.15. We fix levels of additive noise to values found for the 
non-robust case above, on order to demonstrate that by adjusting the decision threshold, one can 
obtain approximate fits to the same data. This is expected from our results above: Figure 6 
shows that, while accuracies at given reaction times are higher for mistuned robust vs. non-robust 
models, the effect is modest on the scale of the full range of values traced over an accuracy curve. 
Moreover, for the perfectly tuned case, accuracies at given reaction times are very similar for 
robust and non-robust integrators (Figure 7, with a slightly lower value of R). Thus, comparable 
pairs of accuracy and RT values are achieved for robust and non-robust models, leading to similar 
matches with data. In sum, the accuracy and chronometric functions in Figure 14 show that 
all of the models schematized in Figure 4 — "baseline", "mistuned", "robust", and "recovery" — 
are generally compatible with the chronometric and accuracy functions reported in Roitman and 
Shadlen (2002). 

In order to further test whether empirical data are consistent with the robust integrator model, 
we compared simulated reaction time histograms with those found in Roitman and Shadlen (2002). 
First, Figure 15 compares the reaction time histograms resulting from the "baseline" model (Panel 
A) and the "recovery" model (Panel B). These are plotted in blue; the red histograms are data 
are taken from Subject "B". In both panels, the histograms have similar means, but differ in 
their shape; in particular, the model predicts a broader range of reaction times and a more slowly 
decaying tail of the RT distribution. From these data, we conclude that neither the "baseline" nor 
the "recovery" model quantitatively reproduce the details of reaction time distributions, when the 
free parameters 9 and are constrained by fitting the accuracy and chronometric functions. 

An urgency signal was introduced in Churchland et al. (2008) to better capture behavioral and 
physiological data. We next incorporated such a signal into our model to determine whether it 
would better align our predicted reaction time histograms with the empirical data. We chose to 
implement urgency by assuming a collapsing decision bound, which decreases monotonically from 
a peak value of to a steady state value 9ss with a halflife 

e{t) = eo-{eo-0ss)T^- (23) 

t + 1 1 

2 



22 



(A) (B) 




Dot Coherence (C) Dot Coherence (C) 







Not Robust {R = 0) 


Robust {R= 1.15) 


/? 


{e, ^) 


Perfect Tuning 


~ 5(0) 




(15.3, 14.0) 


(7.6, 14.0) 


Mistuning 


~ A^(0,.l2) 

~ (-.05,.l2) 
~ A^(.05,.l2) 




(11.0, 12.0) 
(9.2,12.1) 
(13.8, 11.5) 


(6.8,12.0) 
(5.7,12.1) 
(8.6,11.5) 



Figure 14: Accuracy (A,C) and chronometric (B,D) functions: data and model predictions. Solid 
dots are behavioral data for rhesus monkeys performing the dot-motion discrimination task (Roit- 
man and Shadlen, 2002). In each panel, the accuracy and chronometric functions are identified 
with behavioral data via a least-squares fitting procedure over the free parameters 6 and v^. In 
Panels (A,B), the robustness threshold = 0, and results are shown for "baseline" and exemplar 
"mistuned" models (see legend in table). In Panels (C,D), R = 1.15, and results are shown for 
"robust" and "recovery" models. The close matches to data points indicate that each model can 
be broadly reconciled with known psychophysics. Parameter values for each curve are summarized 
in the included table. 
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Figure 15: Reaction time histograms, with decision bounds held constant in time and dot coherence 
C=12.8. Histograms for reaction times for our model with both i? = (A) and R = 1.15 (B) 
are plotted in blue. Overlayed are the reaction time histograms for one subject (Subject "B") 
in (Roitman and Shadlen, 2002), in red (semitransparent). Box-and- whisker plots indicate the 
quartiles for each data set. Both histograms have identical means, but clearly differ in basic shape 
(i.e., the model produces longer tails). We emphasize that this mismatch is a property of our basic 
integration to bound model, regardless of the value of the robustness limit R (see text). 
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Figure 16: Reaction time histograms, with collapsing decision bounds. This shows that introducing 
collapsing bounds produces simulated RT histograms that are a closer match to data, for both non- 
robust (Panel A, ^ = 0) and robust (Panel B, ^ = 1.15) integrators. This shows that RT histograms 
can be similarly well matched to data for both cases. Here ti = 500, — 25, and 0ss = 0. 
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Figure 16 compares model reaction time histograms produced with the cohapsing bound against the 
data, and indeed finds a closer fit: qualitatively, the improvement in fit is similar for both the non- 
robust {R = 0) and robust {R = 1.15) cases. In sum, this shows that the robust integrator model 
is capable of producing roughly similar patterns of reaction times compared with those observed 
experimentally. 

4 Discussion 

A wide range of cognitive functions require the brain to process information over time scales that 
are at least an order of magnitude greater than values supported by membrane time constants, 
synaptic integration, and the like. Integration of evidence in time, as occurs in simple perceptual 
decisions, is one such well studied example, whereby evidence bearing on one or another alternative 
is gradually accumulated over time. This is formally modeled as a bounded random walk or drift- 
diffusion process in which the state (or decision) variable is the accumulated evidence for one 
choice and against the alternative(s). Such formal models explain both the speed and accuracy of 
a variety of decision-making tasks studied in both humans and nonhuman primates (Ratcliff, 1978; 
Luce, 1986; Gold and Shadlen, 2007; Palmer et al., 2005), and neural correlates have been identified 
in the firing rates of neurons in the parietal and prefrontal association cortex (Mazurek et al., 2003; 
Gold and Shadlen, 2007; Churchland et al., 2008; Shadlen and Newsome, 1996; Schall, 2001; Shadlen 
and Newsome, 2001; Kim et al., 2008). The obvious implication is that neurons must somehow 
integrate evidence supplied by the visual cortex, but there is mystery as to how. 

The reason this is a challenging problem is that the biological building blocks operate on rel- 
atively short time scales. From a broad perspective, the challenge is to assemble neural circuits 
that that can sustain a stable level of activity (i.e., firing rate) and yet retain the capability to 
increase or decrease firing rate when perturbed with new input (e.g., momentary evidence). A well 
known solution is to suppose that recurrent excitation might balance perfectly the decay modes 
of membranes and synapses (Cannon et al., 1983; Usher and McClelland, 2001). However, this 
balance must be fine tuned (Seung, 1996; Seung et al., 2000), or else the signal will either dis- 
sipate or grow exponentially (Figure (A), top). Several investigators have proposed biologically 
plausible mechanisms that mitigate somewhat the need for such fine tuning (Lisman et al., 1998; 
Goldman et al., 2003; Goldman, 2009; Romo et al., 2003; Miller and Wang, 2006; Koulakov et al., 
2002). These are important theoretical advances because they link basic neural mechanism to an 
important element of cognition and thus provide grist for experiment. 

Although they differ in important details, many of the proposed mechanisms can be depicted 
as if operating on a scalloped energy landscape with relatively stable (low energy) values, which 
are robust to noise and mistuning in that they require some activation energy to move the system 
to a larger or smaller value (Figure (A), bottom; cf. (Pouget and Latham, 2002)). The energy 
landscape is a convenient way to view such mechanisms - which we refer to as robust integrators 
- because it also draws attention to a potential cost. The very same effect that renders a location 
on the landscape stable also implies that the mechanism must ignore information in the incoming 
signal (i.e., evidence). Here, we have attempted to quantify the costs inherent in this loss. How 
much loss is tolerable before the circuit misses substantial information in the input? How much 
loss is consistent with known behavior and physiology? 

We focused our analyses on a particular well-studied task because it offers critical benchmarks to 
assess both the potential costs of robustness to behavior and a gauge of the degree of robustness that 
might be required to mimic neurophysiological recordings with neural network models. Moreover, 
we know key statistical properties of the signal and noise to be accumulated over time, based on 
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firing properties in area MT. 

Our central finding is that ignoring a surprisingly large part of the motion evidence would have 
almost negligible impact on performance. Indeed, we found that speed and accuracy are preserved 
even when almost a full standard deviation of the input distribution is ignored. We also found that 
a similar degree of robustness provides protection of performance against mistuning of recurrent 
excitation. Although in general this protection is only partial (Figure 6) , for the controlled duration 
task it can be nearly complete (Figure 13(A), T > 2) depending on the presence of a decision bound. 

We can appreciate the impact of robust integration intuitively by considering the distribution 
of random values that would increment the stochastic process of integrated evidence. Instead of 
imagining a scalloped energy surface, we simply replace all the small perturbations in integrated 
evidence with zeros. Put simply, if a standard integrator would undergo a small step in the positive 
or negative direction, a robust integrator instead stays exactly where it was. In the setting of 
drift-diffusion, this is like removing a portion of the distribution of momentary evidence (the part 
that lies symmetrically about zero) and replacing the mass with a delta function at 0. At first 
glance this appears to be a dramatic effect - see the illustration of the distributions in Figure 8 - 
and it is surprising that it would not result in strong changes in accuracy or reaction time or both. 

Three factors appear to mitigate this loss of momentary evidence. First, we showed that setting 
weak values of the input signal to zero can reduce both its mean and its standard deviation by a 
similar amount, creating compensatory effects that result in a small change to the input signal- 
to- noise ratio. Second, we showed that, surprisingly, the small loss of signal to noise that does 
occur would not result in any loss of accuracy if the accumulation were to the same bound as 
for a standard integrator. The cost would be to decision time, but mainly in the regime that is 
dominated by drift - that is, the shorter decision times - hence not a large cost overall. Third, even 
this slowing is mitigated by the temporal dynamics of the input. Unlike for idealized drift diffusion 
processes, real input streams possess finite temporal correlation. Left unchecked, this would imply 
greater variability in the integrated signal. Interestingly, removing the weakest momentary inputs 
reduces the temporal correlation of the noise component of the input stream. This can be thought 
of as allowing more independent samples in a given time period, thereby improving accuracy at a 
given response time. 

Our robust integrator framework shares features with existing models in sensory discrimination. 
The interval of uncertainty model of Smith and Vickers (1989) and the gating model of Purcell et 
al. (2010) ignore part of the incoming evidence stream, yet they can explain both behavioral and 
neural data. We suspect that the analyses developed here might also reveal favorable properties 
of these models. Notably, some early theories of signal detection also featured a threshold, below 
which weaker inputs fail to be registered - so called high threshold theory (reviewed in (Swets, 
1961)). The primary difference in the current work is to consider single decisions made based on 
an accumulation of many such thresholded samples (or a continuous stream of them). 

Although they are presented at a general level, our analyses make testable predictions. For 
example, they predict that pulses of motion evidence added to random dot stimulus would affect 
decisions in a nonlinear fashion consistent with a soft threshold. Such pulses are known to affect 
decisions in a manner consistent with bounded drift diffusion (Huk and Shadlen, 2005) and its 
implementation in a recurrent network (Wong et al., 2007). A robust integration mechanism further 
predicts that brief, stronger pulses will have greater impact on decision accuracy than longer, weaker 
pulses containing the same total evidence. 

However, we believe that the most exciting application of our findings will be to cases in 
which the strength of evidence changes over time, as expected in almost any natural setting. One 
simple example is for task stimuli that have an unpredictable onset time, and whose onset is 
not immediately obvious. For example, in the moving dots task, this would correspond to subtle 
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increases in coherence from a baseline of zero coherence. Our preHminary calculations agree with 
intuition that robust integrator mechanism will improve performance: in the period before the 
onset of coherence, less baseline noise would be accumulated; after the onset of coherence, the 
present results suggest that inputs will be processed with minimal loss to decision performance - 
despite the continued ignoring of weak components. This intuition can be generalized to apply to 
a number of settings with non- stationary sensory streams. 

Many cognitive functions evolve over time scales that are much longer than the perceptual 
decisions we consider in this paper. Although we have focused on neural integration, it seems 
likely that many other neural mechanisms are also prone to drift and instability. Hence, the need 
for robustness may be more general. Yet, it is difficult to see how any mechanism can achieve 
robustness without ignoring information. If so, our finding may provide some optimism. Although 
we would not propose that ignorance is bliss, in the right measure it may be less costly than one 
would expect. 
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1 Derivation of effective model 



Here we we describe a family of neural circuit models, parameterized by a robustness value i2, that 
produce dynamics similar to that of the central robust integrator model of the main paper (given by 
Equation 5 in the main text). In particular, as i? ^ 0, the dynamics reduce to a perfect integrator. 
Our construction follows closely that of Koulakov et al. (2002) and Goldman et al. (2003). 

The circuits that we will study derive their robustness from multistability, which follows from 
recurrent self-excitation among multiple subpopulations (or subunits). We begin with a system of 
differential equations, one for each subunit. Depending on the activity in the rest of the network, 
each individual subunit can be become bistable, so that its eventual steady-state value (i.e., "On" 
or "Off') depends on its past. This effect, known as hysteresis, underlies several robust integrator 
models (Koulakov et al., 2002; Goldman et al., 2003). 

The circuit integrates inputs via sequential activation of subunits, in an order determined by 
graded levels of "background" inputs (or biases) to each subunit. Following Goldman et al. (2003), 
we collapse the N differential equations that describe individual subunits into a equation that 
approximates the dynamics of the entire integrator. This expression for the total firing rate E(t) 
averaged over all subpopulations reduces to the robust integrator model of the main text (Equation 
5 in the main text). 

1.1 Firing rate model 

The firing rate r^(t) of the ith bistable subunit {i G {1,2,..., N}) is modeled by a firing rate equation: 

TE^ = 'r, + f{r,) (1) 



N 

p^ri + q{l + ^)^rj + aAI - hi 



f{n)=r- + {r^ -r-)H _ 

The parameters and variables in this equation are as follows: 

• ri{t): Firing rate of subunit (or pool) 

• aAI{t) : Input signal, with synaptic weight 

• p: Within-pool "synaptic" weight 

• q: Between-pool "synaptic" weight 

• /3: Fractional mistuning of feedback connectivity 

• N: Number of subunits in the integrator network 

• r~: Minimum firing rate of a subunit 

• r~^: Maximum firing rate of a subunit 

• te ' Time constant governing firing rate dynamics 

We also define: 

• E{t) — X^^i ri{t): Overall integrator activity 
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Figure 1: Simultaneous plots of the identity line and the "feedback line," G{E) for two circuits with 
different numbers of subunits. (A) Here = 1, and so the feedback line G{E) — f{E) is exactly 
determined as a function of £^ = ri (see Equation (3)). The two intersections cq and ci are stable 
fixed points. In this way, the subunit's firing rate is bistable, and the value attained will depend on 
the history of the circuit activity. As AI is changed, this translates f{E), eventually eliminating 
either Cq or ci and forcing the subunit to the remaining stable fixed point (here, cq corresponds to 
E = 8 Hz. and ci to 40 Hz). In dashed curves, the feedback line is plotted for two such values 
of AI. (B) Now A^ > 1 and so the feedback line G{E) is no longer unambiguously specified as 
a function of its argument. The function is instead the sum of A^ potentially bi- valued functions, 
who's actual values will depend on the stimulus history. We represent this fact by plotting the 
feedback line as a set of stacked boxes, representing the potential contribution of the i^^ subunit to 
the total integrator dynamics (Goldman et al., 2003). 



Figure 1(A) demonstrates the firing rate dynamics of a circuit composed of a single subunit 
(A^ = 1). Here we refer to Equation (1) and plot two curves vs. E — ri. The first is the identity line, 
corresponding to the first "decay" term in Equation (1). The second is the "feedback" line, defined 
by the second term in Equation (1). Since A^ = 1, this simplifies to f{ri). The two intersections 
marked cq and ci are stable fixed points (which we refer to as "On" and "Off' respectively). Thus, 
the subunit shown is bistable. Importantly, however, the location of the step in f{E) varies with 
changes in the input signal (as per Equation 1). In particular, substantial values of AI{t) will 
(perhaps transiently) eliminate one of the fixed points, forcing the subunit into either the "On" or 
the "Off' state with ri — r+ or = r_, respectively. Moreover, the change is self-reinforcing via 
the recurrent excitation pri. The range over which a given subunit displays bistability is affected by 
the mistuning parameter /3, which scales the total recurrent excitation from the rest of the circuit. 

We now derive the dynamics of the overall firing rate E{t). After summing both sides of Equation 
(1) over i we have: 

rE^ = -E + G{E) (2) 

where 

G{E) = r- + l!L__!li J2h[{p- q{l + /3))r. + Nq{l + ^)E + aAI - h] . (3) 

i=l 

At this point, we almost have a differential equation for a single variable, E{t). However, Equa- 
tion (3) still depends on the A^ activities of the individual subunits, and at any particular time 
their values are not uniquely determined by the value of E; we can only bound their values as 
r~ < Ti < . We will return to discuss the dynamics of Equation 2 below. 
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Figure 2: Plot of possible equilibria for Equation 2. (A) The extent of each multivalued feedback 
function defines the minimum input necessary to perturb the system away from equilibrium, defining 
the "fixation" lines. As ^ oo, the stable fixed points become more tightly packed on the interval 
(r~, r+). (B) When the integrator is mistuned, the fixation lines and the feedback line are no longer 
parallel. The rate that the integrator accumulates input is approximated by the distance between 
the center line of the feedback subunits {G{E))^ and the feedback line. However, integration only 
occurs when the "fixation condition" is no longer satisfied, i.e. when the feedback line is no longer 
bounded by the fixation lines. 



1.2 Bias term 

The bias term for the i^^ subunit, 6^, is set by analyzing the range of values of E for which the 
exact value of the feedback function /(n) is unknown. In the case of an integrator composed of 
only a single subunit, we choose the bias term to cause the positive input needed to force the unit 
to be on, and the negative input needed to force the unit to be off, to take the same values. This 
yields hi = r>(r++r-) _ 

The general case of A^ subunits is more complicated. Now the feedback contribution of the i^^ 
unit, /(n), is no longer a simple function of the population activity E. Instead, it has additional 
dependence on its own activity r^. We see this clearly in Equation 3, where the values of ri that 
contribute to the definition of G{E) are unspecified. However, we do know that each ri is trapped 
between r+ and r~ . Therefore, we can plot G{E) as the sum of a sequence of bivalued functions of 

see Figure 1(B) and (Goldman et al., 2003). The contribution from each pool is then represented 
by the shaded region in the plot. Finally, the bias terms are chosen to center these shaded boxes 
over the identity line. The correct biases in this general case are: 

^ ^ + q{{i - l)r+ + (A^ - i)r-) . (4) 



1.3 Fixation lines 

We next define the "fixation" lines {G^{E) and G~{E)), which are the consequence of this multi- 
valued property of the integrator. These lines define a region (the fixation region) that runs across 
the outermost corners of the "stacked boxes" in Figure 2. 
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The term fixation region refers to the following property: if the input AI is such that the identity 
line lies within the fixation region, then the integrator will possess a range of closely spaced fixed 
points (where E = G[E)). Thus, E is not expected to change from its current value; in other 
words, integration of A/ will not occur (Goldman et al., 2003; Koulakov et al., 2002). Recall that 
A/ acts to shift these boxes leftward or rightward relative to the identity line, just as in the analysis 
of Figure 1. As a consequence, it is weak inputs that fail to be integrated. 

From this analysis, we can see that integration by the system as a whole relies on two con- 
cepts. The first is a condition on A/ necessary to eliminate fixed points; we call this the "fixation 
condition." The second is the question of how quickly to integrate once this condition is no longer 
satisfied. We address this next. 

1.4 Integration 

Based on the analysis above, we derive a reduced model that approximately captures the dynamics 
of the "full" model indicated by Equation 2. We call this the "effective" model. The rate of change 
oi E - i.e., the rate of integration - is given by the distance between the the current value of G{E) 
and the identity line. We approximate this by the distance between the middle of the fixation lines, 
which we define as G{E), and the identity line. This is pictured in Figure 2(B), and yields: 

dE 

TE— = -E + G{E)^-E + G{E) (7) 
G{E) = (l + p)E+ — -p ^ ' (8) 

We emphasize that integration by this equation only occurs when the "fixation" condition is no 
longer satisfied, i.e. when the fixation lines no longer bound the the identity line. 

1.5 Fixation condition 

The last step in defining the 1-dimensional "effective" model is determining the fixation condition. 
We must solve for the values of E that cause the feedback line to lie between the two fixation lines: 



No Change in E 



G-{E) <E<G^{E) 



G{E)- 



(r^ 



2Nq 
(r"*" + r 



'^<E<G{E)+^— 



2Nq 



2N 



< 



(r+ -r ){p- Pq) 
2Nq 



(9) 
(10) 

(11) 



If this condition is violated with AI = 0, the integrator displays runaway integration {(3 > 0) or 
leak (/3 < 0). If it is satisfied when AI = 0, we have a condition on the level of AI that must be 
present for integration to occur. This yields a piecewise-defined differential equation, corresponding 
to when integration can and cannot occur: 
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TE 



dE 
dt 







(3 



(r++r~) 



(r++r~) 



2N 



< 



(r+-r ){p-(3q) 
2Nq 



otherwise 



Nq ^ 2N 

We now simplify this equation as follows. First, we assume that p is much larger than ^q^ so 



that 



(r+-r ){p-(3q) ^ (r+-r )p 



2Nq 



2Nq 



= i?, where R is the robustness parameter in the main text. Next, we 



note that as increases, the additive term l^ ^^^2N ^ becomes negligible (here the synaptic weights 
a and p can adjusted so that the remaining terms do not vanish). Finally, we define = This 
yields the central Equation 5 of the main text: 



dE 



:\PE + kAI\ < R 

I3E + : otherwise 



(12) 



1.6 Quality of the model reduction 

Figure 3 compares the reduced "effective" model, defined by Equation 12 and denoted by the 
variable and the "full model" of multiple subunits, defined by Equation 1 and denoted by the 
variable E. Results are given for three different values of the normalized robustness limit R ^ are 
displayed, all in response to the same input signal A/(t) for ease of comparison. As R increases, 
the ability of the effective model to track the full model decreases. The quality of the reduction 
can be quantified by examining the relative error between the full and effective models: 

, E(t) -E(t) 

^ ^ E{t) ^ ^ 

Histograms of e(t) evaluated at t = 500 sec. are given in Figure 3(D-F). The agreement of the 
two models is within roughly 20% across a range of robustness values R. This is sufficient for 
our purposes of demonstrating an approximate connection between the simplified integrator model 
used in the main text and one of its many possible neural substrates. 



2 Additional indicators that robust integration is compatible with 
empirical data from decision tasks 

As emphasized in the main text, we make several assumptions about how evidence is integrated 
over time in our model of perceptual decision making. The first is that the tuning of connectivity in 
the integrator circuit is inherently imprecise from trial to trial. This imprecision in feedback tuning 
results in spurious activity, in the form of either exponential growth or decay. To counter this, we 
consider the presence of a robustness mechanism meant to ameliorate this problem by allowing the 
integrator to fixate at various levels of activity despite mistuning. 

These assumptions differ from, e.g., the drift-diffusion model for two-alternative decision mak- 
ing, which has been shown to explain psychophysical and physiological data. An important question 
is therefore whether the model at hand will also be able to capture these data. In the main text we 
find that the answer is yes, for behavioral data on decision accuracy and speed (Figures 14 and 16 
in the main text). We next expand this analysis by considering several other aspects of the data: 

^This is the robustness parameter divided by the steady-state standard deviation of the input signal; see main 
text. 
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t (ms) 6(500) 

Figure 3: Quality of the reduction from the "fuh" model to the "effective" model. For three different 
values of the robustness hmit traces E{t) for the full model (Equation 1), and its reduction E{t) 
(Equation 1.5) are compared (Panels A, B, and C). Here, the same signal is integrated by each 
model. Panels D, E, and F show histograms (N=100) of the relative error between the two models 
at t = 500 ms. (See Equation 13). The mean fi and standard deviation a for each distribution 
are also indicated. At these levels of the effective model gives an approximation of the full 
subpopulation-based model with low to moderate error (see text). 
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Figure 4: Reaction time histograms from Figures 15 and 16 of the main text, sorted by correct 
and incorrect responses and plotted in Panels (A) and (B), respectively. Correct trials are pictured 
above, while incorrect trials are below; settings as in the table included in Figure 14 of the main 
text. (A) Mean reaction time for error trials is 740 ms., vs. 671 ms. for correct trials; here a 
non-robust integrator is precisely tuned (a^ = 0, and i? = 0) (B) For a robust integrator with 
imprecise tuning, error trials are now shorter (644 vs. 652 ms.). 



error-trial vs. correct-trial reaction time histograms, controlled duration accuracy performance, 
decision-triggered stimulus predictions, and traces of neural firing rates vs. time. 

2.1 Asymmetry of reaction times between correct and incorrect trials 

Roitman and Shadlen (2002) and many other authors report differences in mean reaction times 
when responses are sorted by correct vs. incorrect responses. In Figure 4, we perform the same 
analysis on the data in Figure 14 of the main text. First, Panel (A) shows that our model of a 
perfectly tuned, non-robust integrator produces the shows the same qualitative asymmetry, with 
error trials on average having longer reaction times. With the addition of imprecise tuning and 
robustness, however this asymmetry can be eliminated (Panel (B)). 

Numerous changes to the model could recover the longer error RTs. One possibility is to 
introduce a bias in mistuning parameters: choosing ^ > has the desired effect. Others include 
inclusion of collapsing bounds, urgency signals, and other forms of trial-to-trial variability (for 
example Ratcliff and Rounder (1998)). 

2.2 Model consistency with controlled duration data 

In the main text, we showed that an integrator model with imprecise tuning and robustness can 
reproduce the chronometric and psychometric curves reported in Roitman and Shadlen (2002) for 
the free-response task. We also considered the consequences of both robustness and mistuning in 
a controlled duration task. Here we show that the same parameters (Table, Figure 14 of the main 
text), this model can reproduce behavioral data reported in Kiani et al. (2008) for the controlled 
duration task. 

We assume bounded integration, where an integration threshold determines which alternative 
is selected, and the subject waits until the end of the trial to report the selection. The comparison 
is demonstrated in Figure 5, where model the accuracy functions from the model are overlaid on 
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Figure 5: Comparison of model with controlled duration accuracy, C=12.8. The perfectly tuned 
model (A), and imprecise/robust model (B), both predict accuracy that is comparable with those 
reported in (Kiani et al., 2008) (Dotted hues, overlaid). In both cases, the CD task is modeled 
with bounded integration, with threshold set as in the table of Figure 14, main text. 

the data provided in the Kiani et al. study, with no parameter changes (including threshold) from 
the data reported in the fit to the Roitman et al. reaction time task data in the main text. 



2.3 Decision-triggered stimuli 

Kiani et al. (2008) also report the time-course of motion evidence leading to the eventual decision 
in the controlled duration task. Specifically, the authors study long duration trials with neutral 
stimuli (i.e., zero coherence, so neither alternative is "correct"). In our model, motion evidence is 
encoded by the stimulus A/(t) (see main text). In Figure 6 we sort 400 trials (coherence C=0) based 
on which alternative was selected. We average the stimuli, again assuming bounded integration in 
a controlled duration task. 

We observe that the results of our model, both perfectly tuned and imprecisely tuned with 
nonzero robustness, are consistent with those reported by Kiani et al. Specifically, the early separa- 
tion of motion energy profiles for rightward (red) and leftward (blue) choices are indicative bounded 
accumulator, and do not qualitatively change when mistuning and robustness are included in the 
model. 



2.4 Trial-sorted traces of firing rates vs. time 

Firing rates of LIP neurons reflect the time course of accumulated sensory evidence (Gold and 
Shadlen, 2007). We verified that our model displays analogous ramping activity when averaged 
over the similar number of trials. In particular, we checked this both for the perfectly tuned, 
non-robust version of our model (Figure 7(A-B)) and in the presence of mistuning and robustness 
(Figure 7(C-D)). 

Integrator activity averaged across multiple trials is averaged, and aligned to both the onset 
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Figure 6: Time course of decision-triggered stimuli. Stimulus realizations A/(t) are sorted by "left" 
vs "right" alternative selected (red vs. blue; neither alternative is "correct" as coherence C=0), and 
averaged across trials. Again assuming perfect tuning, (A), and imprecise tuning with robustness, 
(B), we find that both results are consistent with the motion energy profiles reported in Kiani et 
al. (2008). All integrator settings are the same as reported in the table in Figure 14 of the main 
text. 
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Figure 7: Time course of model integrator activity E{t), during stimulus integration in reaction 
time tasks. Integration of input signal at various coherence values, with responses aligned at the 
beginning (A, C) and end (B, D), averaged across 400 trials. As expected, activity ramps up faster 
as dot coherence increases in both the perfectly tuned (A, B) and imprecisely tuned/robust (C, D) 
cases, in accordance with physiological observations. (For example, Roitman and Shadlen (2002).) 
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of stimulus and upon reaching threshold in both model setups. Activity ramps up (integrates) 
faster at higher dot coherence values, consistent with physiological measurements. This provides 
further evidence that our two additional model assumptions are consistent with known physiology 
for circuits involved in perceptual decision making. 
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